1 -oo 



Ig PATENT APPLICATION 

i"* IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 

\ o Docket No: 28049/36241 

PATENT APPLICATION TRANSMITTAL UNDER 37 C.F.R. 1.53 

Box Patent Application o ^ 



Assistant Commissioner for Patents 
Washington, D.C. 20231 



Sir: '^"rr^^ 



Transmitted herewith for filing is the patent application of o 



Inventor(s): Venugopal SRINIVASAN 

Title: MULTI-BAND SPECTRAL AUDIO ENCODING 

1 . Type of AppI ication 

IS This is a new application for a 
^ utility patent. 
□ design patent. 

□ This is a continuation-in-part application of prior application no. 

2. Application Papers Enclosed 

1 Title Page 

46 Pages of Specification (excluding Claims, Abstract, Drawings & Sequence Listing) 

14 Page{s) of Clainns 

1 Page(s) of Abstract 

3 Sheet(s) of Drawings (Figs. 1 to 3) 

□ Formal 

m Informai 

CERTIFICATION UNDER 37 CFR 1.10 

I hereby certify that this Patent Application Transmittal and the documents referred to as enclosed 
therewith are being deposited with the United States Postal Service on April 6, 2000, in an envelope 
addressed to the Assistant Commissioner for Patents, Washington, D.C. 20231 uti izing the Express 
Mail Post Office to Addressee" service of the United States Postal Service under Mailing Label No. 
EM362728865US. 

rTchard ZIMM^RMANN 




Declaration or Oath 



s Enclosed 

13 Executed by (check all applicable boxes) 
IS !nventor(s) 

□ Legal representative of inventor(s) 
(37 CFR1.42 or 1.43) 

□ Joint inventor or person showing a proprietary interest on behalf of 
inventor who refused to sign or cannot be reached 

□ The petition required by 37 CFR 1 .47 and the statement required 
by 37 CFR 1 .47 are enclosed. See Item 6D below for fee. 

□ Not enclosed - the undersigned attomey or agent is authorized to file this application 
on behalf of the applicant(s). An executed declaration will follow. 

Additional Papers Enclosed 

□ Preliminary Amendment 

□ Information Disclosure Statement 

□ Declaration of Biological Deposit 

□ Computer readable copy of sequence listing containing nucleotide and/or amino 
acid sequence 

□ Microfiche computer program 

□ Verified statement(s) claiming small entity status under 37 CFR 1 .9 and 1 .27 

□ Associate Power of Attorney 

□ Verified translation of a non-English patent application 

□ An assignment of the invention 
^ Return receipt postcard 

□ Other 



5. Priority Applications Under 35 USC 119 

Certified copies of applications from which priority under 35 USC 119 is claimed are listed 



below and 

□ are attached. 

□ will follow. 





APPLICATION NO. 


FILED 








j COUNTRY 













6. Filing Fee Calculation {37 CFR 1.16) 
A. ^ Utility Application 



CLAIMS AS FILED - INCLUDING PRELIMINARY AMEND 


WENT (IF ANY) I 




SMALL ENTITY 


OTHER TH 
EN 


AN A SMALL 
TITY 




NO. FILED 


NO. EXTRA 


RATE 


FEE 


RATE 


FEE 


BASIC FEE 








$345.00 




$690.00 


TOTAL 


33 -20 


= 13 


X 9 = 


$ 


X18 = 


$234.00 


INDEP. 


10 -3 


= 7 


X39 = 


$ 


X78 = 


$546.00 


n First Presenta 


tion of Multiple Dependent Claim 


+ 130 = 


s 


+ 260 = 


$0 


Filing Fee: 


$ 


OR 


$1,470.00 



B. □ Design Application ($155.00/$310.00) Filing Fee: $ 

C. □ Plant Application ($240.00/$480.00) Filing Fee: $ 

D. Other Fees 

□ Recording Assignment [Fee - $40.00 per assignment] 

□ Petition fee for filing by other than all the inventors 

or person on behalf of the inventor where inventor refused 
to sign or cannot be reached [Fee - $130,00] 

□ Other 



Total Fees Enclosed $ 1.470,00 



7. 



Method of Payment of Fees 



Enclosed check in the amount of: 



$ 1.470.00 



□ 



Charge Deposit Account No. 13-2855 in the amount of: 
A copy of this Transmittal is enclosed. 



□ 



Not enclosed 



Deposit Account and Refund Authorization 



The Commissioner is hereby authorized to charge any deficiency In the amount enclosed or any 
additional fees which may be required during the pendency of this application under 37 CFR 
1 .16 or 37 CFR 1 .17 or under other applicable rules (except payment of issue fees), to Deposit 
Account No. 13-2855. A copy of this Transmittal is enclosed. 

Please refund any overpayment to Marshall, OToole, Gerstein, Murray & Borun at the address 



Please direct all future communications to TREVOR B. JOIKE. at the address below. 



below. 



Respectfully submitted, 




MARSHALL, OTOOLE, GERSTEIN, 
MURRAY & BORUN 
6300 Sears Tower 
233 South Wacker Drive 
Chicago, Illinois 60606-6402 
(312)474-6300 
(312) 474-0448 (Telefacsimile) 



April 6, 2000 



SOLE INVENTOR 

"EXPRESS MAIL" mailing label No. EM362728865US. 
Date of Deposit: April 6, 2000 

1 hereby certify that this paper (or fee) is being deposited with 
the United States Postal Service "EXPRESS MAIL POST 
OFFICE TO ADDRESSEE" service under 37 CFR §1 .10 on the 
date indicated above and is addressed to: Assistant 
Commissioner for Patents, Washington, D.C. 20231 





Richard Zimmermann 



APPLICATION FOR 
UNITED STATES LETTERS PATENT 



SPECIFICATION 



TO ALL WHOM IT MAY CONCERN: 

Be it known that I, Venugopal Srinivasan, a citizen India, residing at 
2845 Jarvis Circle, Palm Harbor, 34683, in the County of Pinellas and State of Florida 
have invented a new and useful MULTI-BAND SPECTRAL AUDIO ENCODING, of 
which the following is a specification. 



MULTI-BAND SPECTRAL AUDIO ENCODING 



Related Application 

This application contains disclosure similar to the 
disclosure in U.S. Patent Application Serial No. 09/116,397 filed 
5 July 16, 1998, in U.S. Patent Application Serial No. 09/427,970 

filed October 27, 1999, and in U.S. Patent Application Serial No. 
09/428,425 filed October 27, 1999. 

Technical Field of the Invention 
h3 The present invention relates to a system and method 

1G= for adding an inaudible code to an audio signal and for subse- 
quently retrieving that code. Such a code may be used, for 
^=-3 example, in an audience measurement application in order to 
^=3 identify a broadcast program. 

H Background of the Invention 

15 There are many arrangements for adding an ancillary 

code to a signal in such a way that the added code is not no- 
ticed. For example, it is well known in television broadcasting 
that ancillary codes can be hidden in non-viewable portions of 
video by inserting the codes into either the video's vertical 

20 blanking interval or the video's horizontal retrace interval. Ai 
exemplary system that hides codes in non-viewable portions of 
video is referred to as "AMOL" and is taught in U.S. Patent No. 
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4,025,851. This system is used by the assignee of the present 
application in order to monitor broadcasts of television program- 
ming as well as the times of such broadcasts. 

Other known video encoding systems have sought to bury 

5 ancillary codes in a portion of a television signal's transmis- 
sion bandwidth that otherwise carries little signal energy. 
Dougherty in U.S. Patent No. 5,629,739, which is assigned to the 

i;3 assignee of the present application, discloses an example of such 

Ul a system. 

iQi It is also known to add ancillary codes to audio 

Co signals for the purpose of identifying the signals and, perhaps, 
^= for tracing their courses through signal distribution chains. 

Audio encoding has the obvious advantage of being applicable not 
Cn only to television, but also to radio broadcasts and to pre- 
iP recorded music. Moreover, the speaker of a receiver reproduces, 
in the audio signal output, the ancillary codes that are added to 
audio signals. Accordingly, audio encoding offers the possibil- 
ity of non- intrusive interception (i.e., interception of the 
codes without intrusion into the interior of the receiver) and of 
20 decoding the codes with equipment that has microphones as inputs. 
Moreover, audio encoding permits the measurement of broadcast 
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audiences by the use of portable metering equipment carried by 
panelists . 

In the field of audio signal encoding for broadcast 
audience measurement purposes, Crosby, in U.S. Patent No. 
5 3,845,391, teaches an audio encoding approach in which the code 

is inserted in a narrow frequency "notch" from which the original 
audio signal is deleted. The notch is made at a fixed predeter- 
U mined frequency (e.g., 40 Hz). This approach leads to codes that 
in are audible when the original audio signal containing the code is 
1^1 of low intensity. 
CO A series of improvements followed the Crosby patent. 

Thus, Howard, in U.S. Patent No, 4,703,476, teaches the use of 
=;r two separate notch frequencies for the mark and the space por- 
V} tions of a code signal. Kramer, in U.S. Patent No. 4,931,871 and 
iP in U.S. Patent No. 4,945,412 teaches, inter alia, using a code 

signal having an amplitude that tracks the amplitude of the audio 
signal to which the code is added. 

Broadcast audience measurement systems in which panel- 
ists are expected to carry microphone -equipped audio monitoring 
20 devices that can pick up and store inaudible codes broadcast in 
an audio signal are also known. For example, Aijalla et al . , in 
WO 94/11989 and in U.S. Patent No. 5,579,124, describe an ar- 
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rangement in which spread spectrum techniques are used to add a 
code to an audio signal. The code is either not perceptible, or 
can be heard only as low level "static" noise. 

Also, Jensen et al . , in U.S. Patent No. 5,450,490, 
5 teach an arrangement for adding a code at a fixed set of frequen- 
cies and using one of two masking signals. The choice of masking 
signal is made on the basis of a frequency analysis of the audio 
□ signal to which the code is to be added. Jensen et al . do not 
In teach arrangements for selecting a maximum acceptable code energy 
to be used in each of a predetermined set of frequency intervals, 
^=8 nor do Jensen et al . teach energy exchange coding which transfers 
energy between spectral components and which thereby holds the 
total acoustic energy constant. 
m Preuss et al., in U.S. Patent No. 5,319,735, teach a 

multi-band audio encoding arrangement in which a spread spectrum 
code is inserted in recorded music at a fixed ratio to the input 
signal intensity (code-to-music ratio) that is preferably 19 dB. 
Lee et al., in U.S. Patent No. 5,687,191, teach an audio coding 
arrangement suitable for use with digitized audio signals. The 
20 code intensity is made to match the input signal by calculating a 
signal-to-mask ratio in each of several frequency bands and by 
then inserting the code at an intensity that is a predetermined 
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ratio of the audio input in that band. Lee et al . has also 
described a method of embedding digital information in a digital 
waveform in U.S. Patent No. 5,824,3 60. 

Jensen et al . , in U.S. Patent No. 5,764,763, teach a 
5 method in which code signals consisting of sinusoidal waves at 
ten pre-selected frequencies in a high resolution spectrum are 
added to the original audio in order to represent either a binary 
□ bit (0 or 1) and the start and end of an embedded message. Forty 
unique frequencies are required for encoding these four symbols. 
l6J Their values range from 1046.9 Hz to 2851.6 Hz in a typical 
^=0 practical embodiment. The frequency separation between adjacent 
1. lines in the spectrum is 4 Hz and the minimum separation between 

frequencies selected to constitute the set of 4 0 frequencies is 8 
^=0 Hz. The amplitude of the injected code signal is controlled by a 
1^' masking analysis. In the decoding process, the injected code 
signal is distinguished by the fact that its level will be 
significantly above a noise level computed for a band of frequen- 
cies . 

It will be recognized that, because ancillary codes are 
20 preferably inserted at low intensities in order to prevent the 
codes from distracting a listener of program audio, such codes 
may be vulnerable to various signal processing operations as well 

-5 - 
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as to interference from extraneous electromagnetic sources. For 
example, although Lee et al . discuss digitized audio signals, 
many of the earlier known approaches to encoding a broadcast 
audio signal are not compatible with current and proposed digital 
audio standards, particularly those employing signal compression 
methods that may reduce the signal's dynamic range (and thereby 
delete a low level code) or that otherwise may damage an ancil- 
lary code. In this regard, it is particularly important for an 
ancillary code to survive compression and subsequent de-compres- 
sion by the AC- 3 algorithm or by one of the algorithms recom- 
mended in the ISO/IEC 11172 MPEG standard, which is expected to 
be widely used in future digital television broadcasting systems. 

U.S. Patent Application Serial No. 09/116,397 filed 
July 16, 1998 and U.S. Patent Application Serial No. 09/428,425 
filed October 27, 1999 disclose a system and method for inserting 
a code into an audio signal so that the code is likely to survive 
compression and decompression as required by current and proposed 
digital audio standards. Spectral modulation of the amplitude or 
phase of the signal at selected code frequencies is used to 
insert the code into the audio signal. These selected code 
frequencies, which could comprise multiple frequency sets within 
a given audio block, may be varied from audio block to audio 
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block, and the spectral modulation may be implemented as ampli- 
tude modulation, modulation by frequency swapping, phase modula- 
tion, and/or odd/even index modulation. Moreover, an approach is 
taught to measuring audio quality of each block and of suspending 
encoding in cases where the code might be audible to a listener. 

In experimental systems of the sort taught in the '3 97 
application and in the M25 application, the audio sampling 
process during encoding imposes a delay in excess of twenty 
milliseconds in the audio portion of a television program. Left 
uncorrected, this delay results in a perceptible loss of synchro- 
nization between the audio and video portions of a viewed pro- 
gram. Hence, practical systems of this sort have required the 
use of a compensating video delay circuit. However, it is 
preferable to do without such a circuit. 

Moreover, in systems of the sort taught in the '3 97 
application and in the '425 application, codes are added by 
manipulating pairs of frequencies that are spaced apart by about 
100 Hz. These systems are thus vulnerable to interference, such 
as reverberation or multi-path distortion, that affect one of the 
encoded frequencies substantially more than the other. 

The present invention is arranged to solve one or more 
of the above noted problems. 
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Summary of the Tnvention 

According to one aspect of the present invention, a 
system for adding an interference-resistant, inaudible code to an 
audio signal comprises a sampler, a processor, a frequency 
transformation, a frequency selector, and an encoder. The 
sampler is arranged to sample the audio signal at a sampling rate 
and to generate therefrom a plurality of short blocks of sampled 
audio, where each of the short blocks has a duration less than a 
minimum audibly perceivable signal delay. The processor is 
arranged to combine the plurality of short blocks into a long 
block having a predetermined minimum duration. The frequency 
transformation is arranged to transform the long block into a 
frequency domain signal comprising a plurality of independently 
modulatable frequency indices, where a frequency difference 
between two adjacent ones of the indices is determined by the 
minimum duration and the sampling rate. The frequency selector 
is arranged to select a neighborhood of frequency indices so that 
the frequency difference between a lowest index and a highest 
index within the neighborhood is less than a predetermined value. 
The encoder is arranged to modulate two or more of the indices in 
the neighborhood so as to make a selected one of the indices an 
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extremum while keeping the total energy of the neighborhood 
constant . 

According to another aspect of the present invention, a 
method is provided to add a code to a frequency band of a sampled 
5 audio portion of a composite signal without thereby introducing a 
perceptible delay between the encoded audio portion and another 
portion of the composite signal. The method comprises the steps 
Cj of: a) selecting a sampling rate and a frequency difference 
in between adjacent ones of a predetermined number of frequency 
im indices included in a frequency neighborhood; b) determining 
C-0 from the sampling rate and from the frequency difference a 
duration of a block of samples; c) determining an integral 
number of sequential sub-blocks to make up the block, where the 
integral number is selected so that each of the sub-blocks has a 
iP sub-block duration less than the perceptible delay; d) process- 
ing the block so as to modulate a selected one of the frequency 
indices without changing a total signal energy of the band. 

According to still another aspect of the present 
invention, an apparatus is provided to read a code from an audio 
20 signal. The code comprises a sequence of blocks having a prede- 
termined number of samples of the audio signal, and the code 
comprises a synchronization block followed by a predetermined 
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nuinber of data blocks. The apparatus comprises a buffer memory, 
a frequency transformation, a processor, and a vote determiner. 
The buffer memory is arranged to hold one of the blocks. The 
frequency transformation is arranged to transform the one block 
into spectral data spanning a predetermined number of frequency 
bands, where each of the frequency bands comprises a respective 
neighborhood of frequency indices. The processor is arranged to 
determine, for each of the neighborhoods, if a respective prede- 
termined one of the frequency indices is modulated. The vote 
determiner is arranged to determine that the one block is the 
synchronization block if, in a majority of the frequency bands, 
the respective modulated frequency index is a respective index 
selected for inclusion in the synchronization block. The proces- 
sor is further arranged to determine if, in one of the data 
blocks received subsequent to the synchronization block, a 
respective predetermined one of the frequency indices is modu- 
lated. The vote determiner is further arranged to determine if, 
in a majority of the frequency bands, the respective modulated 
frequency index is a respective index selected for inclusion in 
the one data block. 

According to yet another aspect of the present inven- 
tion, a method is provided to read a code from an audio signal by 
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sequentially transforming a sequence of blocks of audio samples 
into spectral data spanning a predetermined number of frequency 
bands. Each of the frequency bands comprises a predetermined 
number of frequency indices, and each of the blocks comprises a 
predetermined number of the samples. The code comprises a 
synchronization block followed by a predetermined number of data 
blocks. The method comprises the steps of: a) determining, in 
each of the frequency bands of one of the blocks of audio sam- 
ples, if one of the frequency indices is modulated; b) comparing 
each modulated frequency index found in step a) with that index 
selected for modulation in the respective frequency band of the 
synchronization block; c) determining that the one block is the 
synchronization block if the majority of the comparisons made in 
step b) result in a match, and otherwise repeating steps a) 
through b) ; d) determining, in each of the frequency bands of 
one of the data blocks received subsequent to the synchronization 
block, if a respective one of the frequency indices is modulated; 
and, e) comparing the respective modulated frequency indices 
found in step d) with ones of a plurality of predetermined index 
patterns, each of the index patterns uniquely associated with a 
respective code bit, and reading the code bit only if the major- 
ity of modulated indices match the predetermined index pattern. 
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According to a further aspect of the present invention, 
a system for adding an inaudible code to a tone -like audio 
portion of a composite signal having two or more portions com- 
prises a sampling apparatus, a processor, a frequency transforma- 

5 tion, an encoder, a signal analyzer, and an encoder suspender. 
The sampling apparatus is arranged to sample audio at a sampling 
rate and to generate therefrom a plurality of short blocks of 

C3 sampled audio, where each of the short blocks has a duration less 

in than a minimum audibly perceptible signal delay. The processor 
iW is arranged to combine the plurality of short blocks into a long 

^'^0 block having a predetermined minimum duration. The frequency 
transformation is arranged to transform the long block into a 
frequency domain signal comprising a plurality of independently 

^2 modulatable frequency indices located in a plurality of frequency 

6 bands. The encoder is arranged to modulate two or more of the 
indices in each of the frequency bands so as to make a respective 
selected one of the indices an extremum while keeping a total 
acoustic energy of the audio constant. The signal analyzer is 
arranged to determine if the tone-like audio portion has a tone- 

20 like character within any one of the predetermined number of 

neighborhoods. The encoder suspender is arranged to suspend the 
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encoding of the encoder within any neighborhood in which the 
tone-like audio portion has a tone-like character. 

According to yet a further aspect of the present 
invention, a method is provided to add an inaudible code to at 
least one of a predetermined number of frequency neighborhoods 
within a tone-like audio portion of a composite signal having one 
or more additional portions. The method comprises the steps of: 
a) sampling the audio portion and generating from the sampled 
signal a plurality of short blocks, each of the short blocks 
having a duration less than a minimum audibly perceptible signal 
delay; b) combining the plurality of short blocks into a long 
block having a predetermined minimum duration; c) transforming 
the long block into a frequency domain signal comprising a 
plurality of independently modulatable frequency indices; d) 
identifying those neighborhoods, if any, of the predetermined 
number of frequency neighborhoods in which the tone-like audio 
portion has a tone-like character; and, e) modulating a respec- 
tive index in each neighborhood not identified in step d) so as 
to make a selected index in such neighborhood an extremum while 
keeping the total acoustic energy of the audio portion constant, 
and not modulating an index in any of those neighborhoods identi- 
fied in step d) , 

-13- 
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According to still a further aspect of the present 
invention, a broadcast audience measurement system, in which an 
inaudible code added to an audio signal is read by a decoding 
apparatus located within a statistically sampled dwelling, 
comprises an encoder, a receiver, and a decoder. The encoder is 
arranged to add a predetermined code bit to each of a predeter- 
mined number of odd frequency bands within a bandwidth of the 
audio signal. The receiver is within the dwelling and is ar- 
ranged to receive the encoded audio portion. The decoder has an 
input from the receiver, and the decoder is arranged to acquire a 
respective test value of the code bit from each of the frequency 
bands, to compare the test values, to determine that one of the 
test values is the code bit only if that test value is acquired 
from a majority of the frequency bands, and to otherwise deter- 
mine that no code bit has been read. 

According to another aspect of the present invention, a 
broadcast audience measurement system, in which an inaudible code 
added to an audio signal is read within a statistically sampled 
dwelling unit, comprises an encoding apparatus, a receiver, and a 
decoder. The encoding apparatus is arranged to add a code bit to 
a sampled long block of the audio signal, where the long block 
comprises a predetermined number of short blocks. Each of the 
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short blocks has a predetermined duration that is selected to be 
short enough not to be perceptible to a member of a broadcast 
audience. The encoding apparatus is further arranged to modulate 
a selected frequency index in each of a plurality of frequency 
5 neighborhoods so as to make each selected index an extremum in 

the respective neighborhood thereof while keeping a total energy 
of the audio signal constant. The receiver is within the dwell- 
LTJ ing, and is arranged to acquire the encoded audio signal. The 
lS decoder is arranged to read the code from the audio signal. The 
iW decoder has an input from the receiver, and the decoder comprises 
CO a buffer memory arranged to store one of the short blocks. The 
■'^ buffer memory is not arranged to store a long block. 

According to still aspect of the present invention, a 
method of encoding an audio signal comprises the following steps: 
W a) generating a plurality of short blocks from the audio signal, 
wherein each of the short blocks has a duration less than a 
minimum audibly perceivable signal delay; b) combining the 
plurality of short blocks into a long block; c) transforming the 
long block into a spectrum comprising a plurality of independ- 
20 ently modulatable frequency indices; and, d) modulating at least 
two of the indices so as to make one of the indices an extremum 
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while keeping the total energy of a neighborhood of the modulated 

indices substantially constant. 

According to yet aspect of the present invention, a 

method of reading a code element from an audio signal comprises 
5 the following steps: a) transforming at least a portion of the 

audio signal into spectral data spanning a predetermined number 

of frequency bands having a plurality of frequency neighborhoods; 
Q b) determining, for each of the neighborhoods, if one of the 
U1 frequency indices is modulated; and, c) assigning a transmitted 
iW code value to the code element if, in a majority of the neighbor- 
C9 hoods, the respective modulated frequency index is an index 
- selected for inclusion in the audio signal. 

Brief Description of the Drawing 
Q These and other features and advantages will become 

15 more apparent from a detailed consideration of the invention when 
taken in conjunction with the drawings in which: 

Figure 1 is a schematic depiction of a broadcast 
audience measurement system employing a program identifying code 
added to the audio portion of a composite television signal; 
20 Figure 2 is a flow chart depicting an encoding process 

of the present invention; and, 
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Figure 3 is a flow chart depicting a decoding process 
of the present invention. 

Detailed Description of the Inv ention 

Audio signals are usually digitized at sampling rates 
5 that range between thirty-two kHz and forty-eight kHz. For 

example, a sampling rate of 44.1 kHz is commonly used during the 
a digital recording of music. However, digital television ("DTV") 
Lu is likely to use a forty eight kHz sampling rate. Besides the 
ly sampling rate, another parameter of interest in digitizing an 
W audio signal is the number of binary bits used to represent the 
audio signal at each of the instants when it is sampled. This 
number of binary bits can vary, for example, between sixteen and 
^ twenty four bits per sample. The amplitude dynamic range result- 
^=3 ing from using sixteen bits per sample of the audio signal is 
15 ninety-six dB . This decibel measure is the ratio of the square 

of the highest audio amplitude (2^^ - 65536) to the square of the 
lowest audio amplitude (1^ 1) . The dynamic range resulting 
from using twenty- four bits per sample is 144 dB. Raw audio, 
which is sampled at the 44.1 kHz rate and which is converted to a 
20 sixteen-bit per sample representation, results in a data rate of 
705.6 kbits/s. 
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Compression of audio signals is performed in order to 
reduce this data rate to a level which makes it possible to 
transmit a stereo pair of such data on a channel with a through- 
put as low as 192 kbits/s. Audio compression is typically 
5 accomplished by transform coding. A block of audio consisting of 
samples, for example, may be decomposed, by application of a Fast 
Fourier Transform or other similar frequency analysis process, 
C3 into a spectral representation. In order to prevent errors that 
Ij] may occur at the boundary between one block of audio and the 
lOJ previous or subsequent block of audio, overlapping blocks of 
i;y audio are commonly used to produce the samples. In one such 
i= arrangement where 1024 samples per overlapped block are used, a 

block includes 512 "old" audio samples (i.e., audio samples from 
in a previous block) and 512 "new" or current audio samples. The 
W spectral representation of such a block is divided into critical 
bands, where each band comprises a group of several neighboring 
frequencies. The power in each of these bands can be calculated 
by summing the squares of the amplitudes of the frequency compo- 
nents within the band. 
20 Audio compression is based on the following principle 

of masking: in the presence of high spectral energy at one 
frequency (i.e., the masking frequency), the human ear is unable 
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to perceive a lower energy signal if the lower energy signal has 
a frequency (i.e., the masked frequency) near that of the higher 
energy signal. The lower energy signal at the masked frequency 
is called a masked signal. A masking threshold, which represents 
5 either (i) the acoustic energy required at the masked frequency 
in order to make it audible or (ii) an energy change in the 
existing spectral value that would be perceptible, can be dynami- 
n cally computed for each band. The frequency components in a 
in masked band can be represented in a coarse fashion by using fewer 
mi bits based on this masking threshold. That is, the masking 
m thresholds and the amplitudes of the frequency components in each 
- band are coded with a smaller number of bits that constitute the 
compressed audio. Decompression reconstructs the original signal 
based on these data, 
li^ It may be noted that the masking threshold depends to 

some extent on the nature of the sound being masked. Tone -like 
sounds, in which only one, or a few, frequencies are present in 
the acoustic spectrum, present special masking problems that are 
not encountered when dealing with a broad-band acoustic signal. 
20 Thus, a signal, that would be masked if added to a passage of 

speech, might be audible to a listener if added to a passage of 
music having the same acoustic energy. 
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A television audience measurement system 10 shown in 
Figure 1 is an example of a system in which the present invention 
may be used. The television audience measurement system 10 
includes an encoder 12 that adds an ancillary code to an audio 
signal portion 14 of a broadcast program signal. Alternatively, 
the encoder 12 may be provided, as is known in the art, at some 
other location in the program signal distribution chain. A 
transmitter 16 transmits the encoded audio signal portion along 
with a video signal portion 18 of the program signal. 

When the encoded signal is received by a receiver 2 0 
located at a statistically selected metering site 22, the audio 
signal portion of the received program signal is processed to 
recover the ancillary code, even though the presence of that 
ancillary code is imperceptible to a listener when the encoded 
audio signal portion is supplied to speakers 24 of the receiver 
20. To this end, a decoder 26 is connected either directly to an 
audio output 28 available at the receiver 20 or to a microphone 
3 0 placed in the vicinity of the speakers 24 through which the 
audio is reproduced. The received audio signal can be either in 
a monaural or stereo format . 

As disclosed in the '397 application and in the M25 
application, audio blocks may comprise 512 samples of an audio 
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stream sampled at a 48 kHz sampling rate. The time duration of 
such a block is 10.6 ms . Because two blocks are buffered, this 
arrangement comprises a total delay of about 22 ms, which would 
be perceptible to a viewer as a loss of synchronization between 
the video and audio signals. To avoid losing synchronization, a 
compensating delay is introduced into the video signal. Because 
it is preferable to do without such compensating delay, the 
encoder 12 implements encoding as represented by the flow chart 
of Figure 2 in order to avoid loss of video/audio synchronization 
while at the same time avoiding the use of a compensation delay 
circuit . 

The encoding implemented by the encoder 12 reduces the 
audio encoding delay to an imperceptible 5.3 milliseconds by 
structuring a complete, or ^^long" , code block as a sequence of 
overlapping short blocks that can be processed in a pairwise 
fashion with correspondingly smaller buffers and that are only Yi 
as long as the blocks used in the ^397 and M25 applications. 

According to the '397 application and the M25 applica- 
tion, a spectral analysis of a sampled interval of the audio 
signal that is long enough to form a block of 512 samples col- 
lected at a sampling rate of 48 kHz yields frequency "lines" 
separated from one another by 93.75 Hz. In these applications, a 
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neighborhood is a set of five consecutive frequency lines cover- 
ing a neighborhood bandwidth of 468.75 Hz that lies within a 
selected portion of the overall bandwidth of the audio portion 
being encoded. A binary data bit, either a '0* or is encoded 

by changing (preferably by boosting) the amplitude of one of the 
frequencies in the neighborhood such that it becomes a local 
extremum (i.e., a maximum in the preferred case, although the 
local extremum could alternatively a minimum) . Another frequency 
in the same neighborhood is changed in the alternate sense (i.e., 
preferably attenuated) in order to maintain the overall energy 
within the band at a constant level, a practice that is referred 
to herein as ''energy exchange encoding" . It has been found that 
the 468.75 Hz neighborhood bandwidth required for a code block is 
great enough that codes may be subject to interference effects 
when two frequencies in a single neighborhood undergo different 
amounts of change. 

In a preferred system of the present invention, a much 
longer "long block" sampling interval (8192 samples taken at 48 
kHz) is used. This longer sampling interval reduces the spacing 
between spectral lines to 5.85 Hz. As will be described in 
greater detail hereinafter, this preferred system writes an 
energy-exchange code bit in a frequency neighborhood containing 
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eight adjacent frequency indices. Thus, this frequency neighbor- 
hood requires a bandwidth of less than 50 Hz. This selection of 
sampling rate, number of samples in a sampling interval, and 
number of frequency indices in a neighborhood leads to a very 
small frequency difference in a neighborhood and thereby offers 
an interference-resistant code having a high degree of invulnera- 
bility to narrow-band interference effects. 

ENCODING BY SPECTRAL MODULATION 
At a step 4 0 of the encoding implemented by the encoder 
12 and shown in Figure 2, an In Buffer having 256 memory loca- 
tions is initialized by setting all of its memory locations to 
zero. Also, an Out Buffer having 128 memory locations is ini- 
tialized by setting all of its memory locations to zero. More- 
over, a sub-block counter and a long-block counter are both set 
to zero. At a step 41, data is shifted from the second half of 
the In Buffer to its first half, and data is copied from the 
second half of a Temporary Buffer to the first half of the Out 
Buffer . 

A short block is constructed at a step 42 by reading 
128 samples of new data from the audio signal portion 14 into the 
second half of the In Buffer which combines these 128 new samples 
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with the last 128 samples of a previous block stored in the first 
half of the In Buffer as a result of the step 41. In order for 
the encoder 12 to embed a digital code in an audio data stream in 
a manner compatible with compression technology, the encoder 12 
should preferably use frequencies and critical bands that match 
those used in compression. The short block length Ng of the 
audio signal that is used for coding may be chosen such that, for 
example, Ng = N^/ j , where j is an integer, and where is the 
length in samples of a long block. A suitable value for is 
256, for example, and a suitable value for % is 8192, for 
example. The short block itself is constructed from the last 128 
samples of a previous block and the 12 8 samples of new data read 
at the step 42 of Figure 2. The samples may be derived from the 
audio signal portion 14 by the encoder 12 such as by use of an 
analog to digital converter. 

The amplitude of the audio signal within a short block 
may be represented by the time-domain function v(n), where n is 
the sample index. The time-domain function v(n) is converted to 
a time value by multiplication by the sample interval at a step 
43. To this end, a ^Vindow function" is defined according to the 
following equation : 
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1 - cos( ) 



^(n) = — (1) 



and is applied to v(n) at the step 43 by multiplication to obtain 
a windowed signal v(n)w(n) which is stored in the Temporary 
Buffer. At a step 44, a Discrete Fourier Transform F (u) of 

5 v(n)w(n), where u is a frequency index, is computed. This 

Discrete Fourier Transform can be performed using the well-known 

Q Fast Fourier Transform (FFT) algorithm. 

iJl The frequencies resulting from the Fourier Transform 

UJ are indexed in the range -12 7 to +127, where an index of 12 7 
m corresponds to exactly half the sampling frequency fg. There- 
- fore, for a forty-eight kHz sampling frequency, the highest index 
would correspond to a frequency of twenty-four kHz, Accordingly, 
ul for purposes of this indexing, the index closest to a particular 
O frequency component fj, where frequency is measured in kHz, 
15 resulting from the Fourier Transform is given by the following 
equation : 



J 



(2) 



24 



where equation (2) is used in the following discussion to relate 
a frequency f j to its corresponding short -block index j . As 
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noted above, in the preferred coding arrangement, sequential 
indices calculated for a short block are separated from each 
other by a frequency of 187.5 Hz. Correspondingly, in consider- 
ing a long block made up of 64 sub-blocks of 12 8 samples each 
5 (where the sub-blocks are processed in pairs having 256 samples) , 
an equation relating the long block index J to a high resolution 
spectral frequency fj in kHz is given by the following: 

U 4096/; ^ , 

m 1^ (3) 

m From equations (2) and (3), it is clear that J = 32j for frequen- 
1€ cies which are common to both the high (long block) and low 

(short block) resolution spectra. 
tn In the preferred high resolution encoding arrangement 

C3 of the present invention, five frequency bands are selected for 
use in a ^^voting'^ arrangement to be discussed in greater detail 
15 hereinafter. For each of the selected frequency bands, a high 
resolution neighborhood of eight long block indices Jl = Js " 
Jg - 3, Js - 2, Jg - 1/ Jsf Js + 1/ Jg + 2, Jg + 3 is defined 
about a central short block index jg with Jg = 32 jg. In one such 
embodiment, the selected frequencies and indices are shown in the 
20 following table: 
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Band Index 


Short Block Cen- 
tral index 


Long Block Cen- 
tral Index 


Long Block Range 


0 


7 


224 


220-227 

(1287 Hz-1328 Hz) 


1 


11 


352 


348-355 

(2035 Hz-2077 Hz) 


2 


15 


480 


476-483 

(2785 Hz-2826 Hz) 


3 


19 


608 


604-611 

(3533 Hz-3574 Hz) 


4 


23 


736 


732-739 

(4282 Hz-4323 Hz) 



It may be noted that each long block in the arrangement 
shown in the above exemplary table is set up to define neighbor- 
hoods having eight long block indices. It will be recognized 
that different numbers of indices could be used. Adding indices 
has the effect of increasing the numerical range that can be 
accommodated in a single block, but it also has the effect of 
increasing the frequency span of a block, thereby rendering the 
code more susceptible to interference effects. 
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Let it be assumed that a long block L consists of 8192 
samples made up of 64 sub-blocks, with each sub-block having 128 
new samples. A 256 -sample short block is constructed from 
adjacent sub-blocks by the use of the window function of equation 
(1) . Thus, L consists of a sequence of sixty four overlapped 
short blocks, each of which has 256 samples. These short blocks 
may conveniently by indexed as S^, where the short block index i 
ranges from 0 to 63 . 

A masking analysis of the sort conventionally used in 
compression algorithms is preferably applied at the step 44 to 
the short blocks in order to determine the maximum change in 
energy or in the masking energy level that can occur at any 
critical frequency band without making the modulation perceptible 
to a listener. These critical frequency bands, determined by 
experimental studies carried out on human auditory perception, 
may vary in width from single frequency bands at the low end of 
the spectrum to bands containing ten or more adjacent frequencies 
at the upper end of the audible spectrum. In the psycho-acoustic 
modeling scheme used in the MPEG-AAC audio compression standard 
ISO/IEC 13818-7:1997, for example, critical band eighteen in- 
cludes two frequencies with indexes 19 and 2 0 of a short audio 
block. The acoustic energy in each critical band influences the 
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masking energy of its neighbors. Algorithms for computing the 
masking effect are described in the standards document such as 
ISO/IEC 13818-7:1997. These analyses may be used to determine 
for each audio block the masking contribution due to ^^tonality" 
as well as "noise" like features of the audio spectrum. The 
tonality index computed by these algorithms at the step 44 
provides a useful tool for determining circumstances under which 
a sub-block may produce audible degradation when encoded. The 
analysis can also be used to determine, on a per critical band 
basis, the amplitude of a time domain code signal that can be 
added without producing any noticeable audio degradation. Thus, 
for a short block frequency index j , belonging to a critical band 
with masking energy Ej , the maximum amplitude of a code signal is 
given by the following equation: 



where 128 is a factor required to convert from a spectral domain 
to the time domain. 



block indices that are very near to the central index of the 
corresponding short block for a selected band. For example, if a 




(4) 



A preferred code waveform is constructed using long 
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sub-block with a sub-block index m and a coding band b is 
considered, and if a spectral frequency having a long block index 
of is enhanced, an appropriate code waveform will have 256 
samples, which can be denoted as C^ip) , where the index p runs 
from 0 to 255. In a preferred embodiment, each of these compo- 
nents is selected to follow the relationship: 

C,(p) - ^,cos((t)^ ^ ^) ^ k.A.cosin ^ (}), ^ (5) 



UJ where A^, is a nominal code amplitude level, is an index in the 
CO long block frequency space, jb is the central index of the 
10 corresponding short block, is given by the following equation: 



271/, ml 28 , , 

(b = ^ (6: 

8192 



4)^ is the starting phase angle for sub-block m, and (J)j is the 
phase angle of the short block frequency index obtained from 
the Fourier Transform analysis. The quantity ensures that the 
15 code component having a frequency index of is in phase in all 
64 blocks constituting the long block. It may be noted that, in 
order to simplify the representation, a multiplication of the 
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code signal with a window function (not shown) may be implemented. 

The above choice for a code waveform provides an energy 
exchange coding feature. For a given large block index Jj^, the 
first cosine term in equation (5) represents an added energy. 
5 The corresponding short block index term, because of the 

change in phase angle of subtracts a compensating amount of 
energy with the assumption that the spectral energy at repre- 
Q sents the overall energy in the coding band b and includes all of 
in the high resolution coding frequencies in the band, 
ly It should be noted that each high resolution frequency 

Ca component, such as Jj,, influences not only the spectral amplitude 

at % but also its neighbors. The most significant impact is on 
=:J the immediate neighbors - 1 and + 1- The constant k^ with 

a value in the range 0 to 0 . 8 is used to control the extent to 
W which a single index j^, compensates for the code signal. 

The window function applied at the step 43 causes 
further interaction among the short block frequency indexes. 
Because the high resolution frequencies are close to each other, 
these amplitude changes are not perceptible. Because of the 
20 encoding operation, the desired long block frequency with index 
is enhanced relative to its neighbors in band. For example, 
if a long block index of 223 is selected, where the corresponding 
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short block central index is seven, and the code energy for all 
64 blocks is calculated, a component with frequency index 223 has 
a higher energy level than the other indices in the neighborhood 
from 220 to 227. 

5 The nominal code amplitude level is chosen such that 

it is the lowest value that permits successful extraction of the 
embedded code during decoding. For most sub-blocks, the nominal 

;:;] code amplitude level A^ is expected to be well below the corre- 

m spending masking amplitude level Mj . However, in cases where Mj 
IQJ is not greater than A^, Mj replaces in equation (5) . 

m In preferred embodiments of the encoding system of the 

present invention, signal analyzers or signal analyzing algo- 
rithms are used to examine each encodable neighborhood of each 
short block to see if the signal being encoded has a tone-like 

W character within that neighborhood. The tonality index calcu- 
lated at the step 44 by the masking algorithm described in 
ISO/IEC 13818-7:1997, for example, provides such a measure. A 
purely tonal audio block is expected to have a tonality index of 
1.0, whereas a "noise-like" block has a tonality index close to 

20 0. If the tonality index for the bands used in coding has a 
value exceeding a tonal threshold, the encoding operation is 
suspended for that sub-block. (See the discussion below regard- 
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ing step 46.) It is noted that, even if several sub-blocks are 
tonal, coded data can still be successfully retrieved because 
there are 64 sub-blocks in each long block. It is the spectrum 
of the long block that is analyzed during decoding. 

A preferred encoding arrangement of the invention uses 
a redundant transmission scheme to make the system more robust. 
As depicted in the table shown above, five different frequency 
bands are defined in the exemplary system. The coding arrange- 
ment disclosed above was described with respect to only one of 
these bands. That is, the five bands are essentially independent 
of each other so that a code symbol can be sent in multiple bands 
at any given time in the interest of providing redundant trans- 
mission . 

One of the advantages of the encoding method described 
above is that the processing uses only 256 samples at each stage, 
of which 12 8 are new samples and 12 8 are carried over from the 
prior processing step. Thus, at a selected sampling rate of 48 
kHz, the total buffer capacity required to hold the samples in a 
"double buffer" is 256 and the corresponding time duration is 
256/48000 = 5.3 milliseconds. As is known to those skilled in 
the arts of perceptual psychology, a loss of synchronization of 
less than about 10 msec between two portions (e.g., left and 
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right stereo channel) of a composite audio signal or between an 
audio and a video portion of a composite television signal is not 
perceptible. Thus, the encoding method of the present invention 
does not require introducing a compensating delay in another 
portion of the signal. When used for television audience re- 
search purposes, the present system has the advantage that it can 
be used without a video delay circuit and without disturbing the 
viewer with a perceptible loss of synchronization. 

In order to design a practical encoding scheme, it is 
essential to develop a synchronization method that will allow the 
decoding system to determine the start of a new message. As is 
often done in encoded messaging systems, a preferred system of 
the invention defines a synchronization block having a unique 
structure that differentiates it from other encoded blocks. At a 
step 45, therefore, a synchronization block consisting of 8192 
samples is selected when the long block counter has a count of 
zero such that the synchronization block has the following 
characteristics: in Band 0, index 220, which is the first 
frequency line in that neighborhood, is enhanced; in Band 1, the 
second frequency line, index 349, is enhanced; in Band 2, the 
third frequency line, index 4 78, is enhanced; in Band 3, the 
fourth frequency line, index 607, is enhanced; and, in Band 4, 
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the fifth frequency line, index 73 6, is enhanced. When the 
decoder analyzes a long block by comparing each enhanced fre- 
quency index with the respective index selected for enhancement 
in a synchronization block and finds a match in at least three of 
the five frequency bands, the system determines that a potential 
synchronization block has been detected, and interprets the long 
blocks following a synchronization block as the actual message 
data . 

As noted above, in discussing the blocks selected for 
an exemplary system and shown in the above table, each long block 
comprises a set of eight indices that can be modulated to form a 
code. In a television audience measurement application of 
interest to the inventor, a complete encoded message may comprise 
forty-eight bits consisting of a sixteen bit Station Identifier 
(SID) and a thirty-two bit time stamp (TS) . To match this 
message to the selected set of indices, the forty-eight bits of 
data may be grouped into sixteen three-bit sets. The decimal 
value of each of these three-bit sets can range from zero to 
seven so that each of the three-bit sets can be encoded by using 
the selected long blocks. In one preferred arrangement, the 
system encodes a value of k (where k is in the range of zero to 
seven) by modulating the k^^ available index. In this arrange- 
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ment, for example, to send a code group having a value = five, 
the 6^^ index in each band (i.e., indices 225, 353, 481, 609, and 
737) is selected at the step 45 for enhancement. In this embodi- 
ment, a forty-eight bit data packet can be transmitted as one 
5 long synchronization block followed by sixteen long data blocks. 
For the choice of code blocks and sampling frequency disclosed 
above, sending these seventeen long blocks requires 2.89 seconds. 
Q This arrangement provides a clear distinction from the synchroni- 
ifl zation block, which has a different index enhanced in each band. 
101 More generally speaking, each of a plurality of possi- 

ble code bits has an index pattern uniquely associated with it, 
- and decoding a bit comprises comparing each of plurality of 

enhanced indices with ones of the index patterns to determine if 
a majority of the enhanced indices match with one of the prede- 
termined patterns. The exemplary embodiment recited above is 
both conceptually straightforward and robust, but may lead to an 
audible beat phenomenon because each code frequency is separated 
from its central short block frequency by the same value in all 
the coding bands. In the case of a code bit of value five, this 
20 constant difference frequency is 5.85 Hz, which corresponds to 
an index difference of one. In another preferred embodiment, 
this problem is overcome at the step 4 5 by choosing as the index 
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pattern a pre-determined pseudo- random combination of frequency 
indexes for each band. Thus, for example, a value of five could 
be coded by using the following frequency indexes in the five 
bands: 225, 355, 476, 607, and 737. The beat phenomenon is 
5 substantially decreased by this change. 

This arrangement of sending the same data in each of 
five bands at the same time fits well with the masking algorithms 
Q discussed above. That is, one can select a masking algorithm 
\Jl that suspends coding in one or more of the bands, but that 
IQJ continues to encode in the other ones of the bands, 
[fl Once the frequencies have been selected at the step 45, 

r the signal at these frequencies is enhanced at the step 46 
2 assuming that the masking level and the tonality as indicated by 
m the tonality index are acceptable. The samples v(n)w(n) stored 
Ijg in the Temporary Buffer are modified according to equations (5) 
and (6) and, at a step 47, the code signal is added to the 
Temporary Buffer. At a step 48, the first half of the Temporary 
Buffer is added to the Out Buffer, and the 128 samples in the Out 
Buffer are passed to the transmitter 16 as encoded data. 
20 At a step 49, the sub-block counter is incremented by 

one and, if the sub-block counter is equal to 64, the long block 
counter is incremented by one. No other sub-blocks are encoded 
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until the long block counter is incremented. When the long block 
counter is equal to 17, then a complete code message (a synchro- 
nization block and sixteen data blocks) has been passed to the 
transmitter 16 and the long block counter is reset to zero to 
begin encoding a new message. If the sub-block counter is not 
equal to 64, or after the long block counter has been reset to 
zero, program flow returns to the block 41. 

DECODING THE SPECTRALLY MODULATED SIGNAL 
A preferred system provides an audio signal acquisition 
arrangement at a receiving location. This location, for example, 
may be within the statistically selected metering site 22 . In 
some instances, the embedded digital code can be recovered from 
the audio signal available at the audio output 28 of the receiver 
20. When such an output is available, it provides a relatively 
high quality signal source. However, many receivers 20 do not 
have the audio output 28, which constrains the audience research 
system operator to acquire an analog audio signal with the 
microphone 3 0 placed in the vicinity of the speakers 24. Because 
audience measurement systems generally have a goal of minimizing 
the intrusion that they make into the measured television viewing 
environment, the microphone 30 is preferably placed behind the 
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receiver 20, where the quality of the signal it acquires is 
degraded from what would be found if the microphone 3 0 were 
placed in front of the receiver 20. This signal degradation has 
led to the failure of many prior art systems that attempted to 

5 read a buried code from an audio signal picked up with a micro- 
phone. However, the redundancy obtained by encoding five fre- 
quency bands as discussed above increases the likelihood that the 

ri code can be successfully recovered. 

i"l In the case where the microphone 3 0 is used, or in the 

IQj case where the signal on the audio output 2 8 is analog, the 
m decoder 2 6 converts the analog audio to a sampled digital output 
r stream at a preferred sampling rate matching the sampling rate of 
:li the encoder 12. In decoding systems where there are limitations 
U\ in terms of memory and computing power, a half-rate sampling 
1® could be used. In the case of half -rate sampling, each short 

block would consist of Nq/2 = 128 samples, and the resolution in 
the frequency domain (i.e., the frequency difference between 
successive spectral components) would remain the same as in the 
full sampling rate case. In the case where the receiver 20 
20 provides digital outputs, the digital outputs are processed 

directly by the decoder 2 6 without sampling but at a data rate 
suitable for the decoder 26. 
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In a practical implementation of audio decoding, such 
as may be used in a home audience metering system, the ability to 
decode an audio stream in real-time is highly desirable. It is 
also highly desirable to transmit the decoded data to a remote 
central office. The decoder 26 may be arranged to run the 
decoding algorithm described below in connection with Figure 3 on 
Digital Signal Processing (DSP) based hardware of the sort 
typically used in such applications. As disclosed above, the 
incoming encoded audio signal may be made available to the 
decoder 2 6 from either the audio output 2 8 or from the microphone 
3 0 placed in the vicinity of the speakers 24. 

As shown by step 50 in the flow chart of Figure 3, a 
circular buffer capable of storing 4096 samples is initialized by 
setting all of its storage locations to zero. Also, a set of 
frequency bins are set to zero. At a block 51, 256 samples are 
read into an audio buffer. Also, a block sample counter is set 
to zero. Before recovering the actual data bits representing 
code information, it is necessary to locate the synchronization 
block which is preferably encoded by enhancing (or diminishing) 
the amplitude of a unique set of frequencies. In one preferred 
embodiment these frequencies have indexes 220, 349, 478, 607, and 
73 6 and each one is in a different coding band. In order to 
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search for the synchronization block, as well as to extract data 
from subsequent blocks within an incoming audio stream, the 
circular buffer is used. The circular buffer has a sufficient 
size to store 4096 samples in the case of half rate sampling. 

5 This arrangement is essential in order to implement a near real- 
time decoding scheme based on a sliding FPT routine which forms 
part of the decoding algorithm shown in the flow chart of Figure 

Q 3 . 

m Let it be assumed that, for the audio buffer currently 

laj stored in the circular buffer, there are a spectral amplitude 
rg BqLJ] and a phase angle ct>o [ J] at a frequency with index J. The 
spectral amplitude Bq [ J] and the phase angle <Po[J] represent the 
spectral values for the 4096 audio samples currently in the 
tfl circular buffer. If two new time domain samples V4094 and V4095 
iS are read from the audio buffer and are inserted into the circular 
buffer as indicated by a step 52 so as to replace the two earli- 
est samples and V;^ in the circular buffer, then the new spec- 
tral amplitude B^[J] and phase angle ^^[J] for each of the indi- 
ces J are determined at a step 53 in accordance with the follow- 
20 ing equation: 
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Thus, the spectrum of the circular buffer can be computed merely 
by updating the existing spectrum for the samples contained in 
the circular buffer according to equation (7) . Even when all the 
spectral values - amplitude and phase - are initially set to 0 at 
the step 50, as new data enters the circular buffer, and as old 
data gets discarded, the spectral values gradually change until 
they correspond to the actual FFT spectral values for the data 
currently in the circular buffer. In order to overcome certain 
instabilities that may arise during computation, multiplication 
of the incoming audio samples by a stability factor (usually set 
to 0.99995) and multiplication of the discarded samples by a 
factor 0.99995^^^^ = 0.902666 is known to most practitioners in 
this field. The sliding FFT algorithm provides a computationally 
efficient means of ' calculating the spectral components of inter- 
est for the 4095 samples preceding the current sample location 
and the current sample itself. The frequency bins are updated at 
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the block 53 with the results of the analysis performed according 
to equation (7) 

If the block sample counter has a count which is a 
multiple of 64, the frequency bins are analyzed and the results 
5 of the analysis are stored in a Status Information Structure 

(SIS) as indicated in step 54 of Figure 3. This value 64 may be 
used because the frequency spectrum of a long block of 4 096 
samples changes very little over a small number of samples of an 
Jn audio stream. Even though the sliding FFT algorithm is used to 
IQj update the spectral values in two sample increments, the analysis 
i:o of the spectrum to locate the synchronization block and to 
r extract data needs to be performed only every 64 samples. Thus, 

4096/64 = 64 SIS structures are used to track the intermediate 
m results of the decoding operation. These SIS structures are 
lis indexed as SISq. SIS^, . . . SIS^^, Each SIS structure is updated 
at 4096 sample intervals, which corresponds to the length of a 
long block in the half -sampling rate case. Each SIS structure 
contains a synchronization flag and a data storage location. 
Also, the SIS includes a counter. 
20 The search for the synchronization block is the first 

step in the decoding process. Let us assume that at a sample 
location where the SIS SIS^^ needs to be updated because a spec- 
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trum, which satisfies the characteristics of a synchronization 
block, is found. In such a spectrum, indexes 220, 349, 478, 607, 
73 6 are enhanced and possess higher spectral power than their 
neighbors in the respective bands. Due to factors such as audio 
5 compression, audio degradation due to amplifier-speaker-micro- 
phone non-linearities, or ambient noise in the case of microphone 
based decoding systems, it is possible that not all the five 
n bands have the desired characteristics. The redundant transmis- 
\n sion feature described above enables detection of a long block as 
IQj being a synchronization block even if only three of the five 
m bands satisfy the criteria for a synchronization block. Once a 
T synchronization block has been detected, a synchronization flag 
.^E within the corresponding SIS structure is set to one. In a 
iin practical implementation, more than one SIS structure can have 
liS its synchronization flag set to one. Usually several adjacent 
SIS structures, for example, SIS]^_2; SISj^.^, SIS]^, SISj^^^/ ^IS^^s' 
may all have synchronization flags set to one because the spec- 
trum of a long audio block does not change rapidly. 

When SISj^ is analyzed 4096 samples later, the algorithm 
20 recognizes the synchronization flag and attempts to extract the 

first three-bit data value encoded in the spectrum. This extrac- 
tion may be done by means of a voting algorithm that compares 
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test values taken from each of the neighborhoods and that accepts 
a test value as the data value if the same test value is found in 
three out of the five band neighborhoods. In addition, if a 
valid data value in the range zero to seven is extracted, the 
counter within the SIS is incremented to show that the first 
member of the sixteen member message data has been extracted. 
The extracted three-bit datum is also stored within the structure 
at a corresponding data storage location. In the event a valid 
datum is not found either at the current location or at any one 
of the fifteen subsequent locations where SISj, is updated, the 
SIS structure's synchronization flag is reset to zero and the 
counter is reset to zero. These actions frees the SIS to once 
again look for synchronization blocks. When an SIS structure's 
counter increments to sixteen, it contains a full message packet 
consisting of forty-eight bits that could be transmitted out, as 
indicated in step 55 of the flow chart in Figure 3. For example, 
the message packet may be transmitted to a Central Office. When 
this transmission is done, the synchronization flag is reset to 
zero and the counter is reset. 

At a block 56, the block sample counter is incremented 
by two corresponding to the two samples read from the audio 
buffer to the circular buffer at the step 52. If the block 
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sample counter does not have a count equal to 256, flow returns 
to the step 52 where two more samples from the audio buffer are 
read into the circular buffer. On the other hand, if the block 
sample counter does have a count equal to 256, flow returns to 
the step 51 where another 256 samples are inserted into the audio 
buffer . 

Although the present invention has been described with 
respect to several preferred embodiments, many modifications and 
alterations can be made without departing from the invention. 
Accordingly, it is intended that all such modifications and 
alterations be considered as within the spirit and scope of the 
invention as defined in the attached claims. 
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WHAT IS CLAIMED IS : 



1 1. A system for adding an interference-resistant, 

2 inaudible code to an audio signal comprising: 

3 a sampler arranged to sample the audio signal at a 

4 sampling rate and to generate therefrom a plurality of short 

5 blocks of sampled audio, each of the short blocks having a 

j^i duration less than a minimum audibly perceivable signal delay; 

a processor arranged to combine the plurality of short 

|1 blocks into a long block having a predetermined minimum duration; 

% a frequency transformation arranged to transform the 

10 long block into a frequency domain signal comprising a plurality 

It of independently modulatable frequency indices, wherein a fre- 

1:21 quency difference between two adjacent ones of the indices is 

liS determined by the minimum duration and the sampling rate; 

14 a frequency selector arranged to select a neighborhood 

15 of frequency indices so that the frequency difference between a 

16 lowest index and a highest index within the neighborhood is less 

17 than a predetermined value; and, 

18 an encoder arranged to modulate two or more of the 

19 indices in the neighborhood so as to make a selected one of the 
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20 indices an extremum while keeping the total energy of the neigh- 

21 borhood constant . 

1 2 . The system of claim 1 wherein the processor com- 

2 prises a digital computer having a buffer memory. 

1 3. The system of claim 1 wherein the frequency trans- 

2n formation comprises a Fast Fourier Transform algorithm. 

|1 4. The system of claim 1 wherein the encoder comprises 

S an algorithm that increases the energy of a selected index in the 

3 neighborhood and that decreases the energy of a short block 

1 associated therewith. 

5 . A method of adding a code to a frequency band of a 

2 sampled audio portion of a composite signal without thereby 

3 introducing a perceptible delay between the encoded audio portion 

4 and another portion of the composite signal, the method compris- 

5 ing the steps of : 

6 a) selecting a sampling rate and a frequency difference 

7 between adjacent ones of a predetermined number of frequency 

8 indices included in a frequency neighborhood; 
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9 b) determining from the sampling rate and from the 

10 frequency difference a duration of a block of samples; 

11 c) determining an integral number of sequential sub- 

12 blocks to make up the block, where the integral number is se- 

13 lected so that each of the sub-blocks has a sub-block duration 

14 less than the perceptible delay; and, 

15 d) processing the block so as to modulate a selected 
16-1 one of the frequency indices without changing a total signal 
ifi energy of the band. 

6. The method of claim 5 wherein the composite signal 

2 comprises a television broadcast signal and wherein the another 

1 portion of the composite signal comprises a video signal. 

ij5 7. The method of claim 5 wherein in step d) the 

2 processing comprises modulating two or more of the frequency 

3 indices within the neighborhood so as to make a selected one of 

4 the indices an extremum. 

1 8. Apparatus for reading a code from an audio signal, 

2 the code comprising a sequence of blocks having a predetermined 

3 number of samples of the audio signal, the code comprising a 
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4 synchronization block followed by a predetermined number of data 

5 blocks, the apparatus comprising: 

6 a buffer memory arranged to hold one of the blocks; 

7 a frequency transformation arranged to transform the 

8 one block into spectral data spanning a predetermined number of 

9 frequency bands, wherein each of the frequency bands comprises a 
10 respective neighborhood of frequency indices; 

Ij.^ a processor arranged to determine, for each of the 

ip neighborhoods, if a respective predetermined one of the frequency 

13:1 indices is modulated; and, 

iff, a vote determiner arranged to determine that the one 

if block is the synchronization block if, in a majority of the 

iS frequency bands, the respective modulated frequency index is a 

liiS respective index selected for inclusion in the synchronization 

iS block; 

19 wherein the processor is further arranged to determine 

20 if, in one of the data blocks received subsequent to the synchro- 

21 nization block, a respective predetermined one of the frequency 

22 indices is modulated; 

23 wherein the vote determiner is further arranged to 

24 determine if, in a majority of the frequency bands, the respec- 
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25 tive modulated frequency index is a respective index selected for 

26 inclusion in the one data block. 

1 9. The apparatus of claim 8 wherein the frequency 

2 transformation comprises a Fast Fourier Transform algorithm 

3 executed by a digital computer. 

10. The apparatus of claim 8 wherein the processor 
comprises a general purpose digital computer operating under 

% program control and having a plurality of algorithms stored in a 

l4i memory. 

)i 11. The apparatus of claim 8 wherein the vote deter- 

% miner comprises an algorithm executed by a digital computer. 

1 12 . A method of reading a code from an audio signal by 

2 sequentially transforming a sequence of blocks of audio samples 

3 into spectral data spanning a predetermined number of frequency 

4 bands, wherein each of the frequency bands comprises a predeter- 

5 mined number of frequency indices, wherein each of the blocks 

6 comprises a predetermined number of the samples, and wherein the 
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7 code comprises a synchronization block followed by a predeter- 

8 mined number of data blocks, the method comprising the steps of: 

9 a) determining, in each of the frequency bands of one 

10 of the blocks of audio samples, if one of the frequency indices 

11 is modulated; 

12 b) comparing each modulated frequency index found in 

13 step a) with that index selected for modulation in the respective 
IM frequency band of the synchronization block; 

l|j c) determining that the one block is the synchroniza- 

161 tion block if the majority of the comparisons made in step b) 

result in a match, and otherwise repeating steps a) through b) ; 

d) determining, in each of the frequency bands of one 

iS of the data blocks received subsequent to the synchronization 

2&i block, if a respective one of the frequency indices is modulated; 

2jfi and, 

22 e) comparing the respective modulated frequency indices 

23 found in step d) with ones of a plurality of predetermined index 

24 patterns, each of the index patterns uniquely associated with a 

25 respective code bit, and reading the code bit only if the major- 

26 ity of modulated indices match the predetermined index pattern. 
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1 13. The method of claim 12 wherein a value of k is 

2 read as the code bit in step e) if the k^^ index in each of the 

3 bands is modulated. 

1 14 . The method of claim 12 wherein the predetermined 

2 index pattern comprises a pseudo-random sequence. 

15. A system for adding an inaudible code to a tone- 
is like audio portion of a composite signal having two or more 

portions, the system comprising: 
g a sampling apparatus arranged to sample audio at a 

T sampling rate and to generate therefrom a plurality of short 

blocks of sampled audio, each of the short blocks having a 
S duration less than a minimum audibly perceptible signal delay; 

a processor arranged to combine the plurality of short 
9 blocks into a long block having a predetermined minimum duration; 

10 a frequency transformation arranged to transform the 

11 long block into a frequency domain signal comprising a plurality 

12 of independently modulatable frequency indices located in a 

13 plurality of frequency bands; 

14 an encoder arranged to modulate two or more of the 

15 indices in each of the frequency bands so as to make a respective 
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16 selected one of the indices an extremum while keeping a total 

17 acoustic energy of the audio constant; 

18 a signal analyzer arranged to determine if the tone- 

19 like audio portion has a tone-like character within any one of 

20 the predetermined number of neighborhoods; and, 

21 an encoder suspender arranged to suspend the encoding 

22 of the encoder within any neighborhood in which the tone-like 
23--, audio portion has a tone -like character. 

'll 16. The system of claim 15 wherein the audio signal is 

% part of a television broadcast signal. 

p 17. The system of claim 15 wherein the frequency 

1^ transformation comprises a Fast Fourier Transform algorithm. 

1 18. The system of claim 16 wherein the signal analyzer 

2 comprises a computer arranged to carry out a masking algorithm 

3 described in ISO/IEC 13818-7:1997. 

1 19. A method for adding an inaudible code to at least 

2 one of a predetermined number of frequency neighborhoods within a 



-54- 



Attorney Docket 
28049/36241 



3 tone- like audio portion of a composite signal having one or more 

4 additional portions, the method comprising the steps of: 

5 a) sampling the audio portion and generating from the 

6 sampled signal a plurality of short blocks, each of the short 

7 blocks having a duration less than a minimum audibly perceptible 

8 signal delay; 

9 b) combining the plurality of short blocks into a long 
block having a predetermined minimum duration; 

Ip c) transforming the long block into a frequency domain 

l|j signal comprising a plurality of independently modulatable 

13r] frequency indices; 

14' d) identifying those neighborhoods, if any, of the 

15; predetermined number of frequency neighborhoods in which the 

tone-like audio portion has a tone-like character; and, 
1:3 e) modulating a respective index in each neighborhood 

18 not identified in step d) so as to make a selected index in such 

19 neighborhood an extremum while keeping the total acoustic energy 

20 of the audio portion constant, and not modulating an index in any 

21 of those neighborhoods identified in step d) . 
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1 20. The method of claim 19 wherein the composite 

2 signal comprises a television broadcast signal and wherein one of 

3 the additional portions comprises a video signal. 

1 21. The method of claim 19 wherein step c) comprises 

2 the step of transforming the long block according to a Fast 

3 Fourier Transform. 

22. The method of claim 19 wherein step c) comprises a 
% sub-step of carrying out a masking algorithm described in ISO/lEC 

13818-7:1997. 

23. A broadcast audience measurement system in which 
an inaudible code added to an audio signal is read by a decoding 

ijj apparatus located within a statistically sampled dwelling, the 

4 system comprising: 

5 an encoder arranged to add a predetermined code bit to 

6 each of a predetermined number of odd frequency bands within a 

7 bandwidth of the audio signal; 

8 a receiver within the dwelling arranged to receive the 

9 encoded audio portion; and, 
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10 a decoder having an input from the receiver, the 

11 decoder arranged to acquire a respective test value of the code 

12 bit from each of the frequency bands, to compare the test values, 

13 to determine that one of the test values is the code bit only if 

14 that test value is acquired from a majority of the frequency 

15 bands, and to otherwise determine that no code bit has been read. 

24. The broadcast audience measurement system of claim 
2ri 23 wherein the audio signal is part of a television broadcast 
signal. 

f 25. The broadcast audience measurement system of claim 

|S 23 wherein the receiver includes a microphone. 

p. 26. The broadcast audience measurement system of claim 

2 23 wherein the receiver comprises an audio output jack. 

1 2 7. A broadcast audience measurement system in which 

2 an inaudible code added to an audio signal is read within a 

3 statistically sampled dwelling unit, the system comprising: 

4 an encoding apparatus arranged to add a code bit to a 

5 sampled long block of the audio signal, the long block comprising 
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6 a predetermined number of short blocks, each of the short blocks 

7 having a predetermined duration that is selected to be short 

8 enough not to be perceptible to a member of a broadcast audience, 

9 the encoding apparatus being further arranged to modulate a 

10 selected frequency index in each of a plurality of frequency 

11 neighborhoods so as to make each selected index an extremum in 

12 the respective neighborhood thereof while keeping a total energy 
13n of the audio signal constant; 

1^ a receiver within the dwelling, the receiver being 

l&l arranged to acquire the encoded audio signal; and, 

a decoder arranged to read the code from the audio 

iT signal, the decoder having an input from the receiver, the 

IS decoder comprising a buffer memory arranged to store one of the 

Ifh short blocks, the buffer memory being arranged to store a long 

M block. 

1 28. The broadcast audience system of claim 27 wherein 

2 the audio signal is part of a television signal. 

1 29. The broadcast audience system of claim 27 wherein 

2 the encoder comprises a frequency transformation arranged to 

3 transform the long block into a frequency domain signal. 
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1 30. The broadcast audience system of claim 27 wherein 

2 the receiver comprises a microphone. 

1 31. The broadcast audience system of claim 27 wherein 

2 the receiver comprises an audio output jack. 

1^. 32. A method of encoding an audio signal comprising 

jS the following steps: 

a) generating a plurality of short blocks from the 

J audio signal, wherein each of the short blocks has a duration 

^3 less than a minimum audibly perceivable signal delay; 
g b) combining the plurality of short blocks into a long 

fh block; 

c) transforming the long block into a spectrum compris- 

9 ing a plurality of independently modulatable frequency indices; 

10 and, 

11 d) modulating at least two of the indices so as to make 

12 one of the indices an extremum while keeping the total energy of 

13 a neighborhood of the modulated indices substantially constant. 
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1 33. A method of reading a code element from an audio 

2 signal comprising the following steps: 

3 a) transforming at least a portion of the audio signal 

4 into spectral data spanning a predetermined number of frequency 

5 bands having a plurality of frequency neighborhoods; 

6 b) determining, for each of the neighborhoods, if one 

7 of the frequency indices is modulated; and, 

g,, c) assigning a transmitted code value to the code 

U element if, in a majority of the neighborhoods, the respective 

IqJ modulated frequency index is an index selected for inclusion in 

lijn the audio signal. 
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ABSTRACT OF THK DISCLOSURE 

An encoder includes a sampler that samples an audio 
signal and that generates from the samples a plurality of short 
blocks of sampled audio. Each of the short blocks has a duration 
5 less than a minimum audibly perceivable signal delay. A proces- 
sor combines the plurality of short blocks into a long block. 
The long block is transformed into a frequency domain signal 
i'^ having a plurality of independently modulatable frequency indi- 
r" ces. The frequency difference between adjacent indices is 
im determined by the minimum duration and the sampling rate of the 
r sampler. A neighborhood of frequency indices is selected so that 
Iz the frequency difference between a lowest index and a highest 
?n index within the neighborhood is less than a predetermined value, 
n Two or more of the indices are modulated in the neighborhood so 
15 as to make a selected one of the indices an extremum while 

keeping the total energy of the neighborhood constant. A plural- 
ity of frequency bands are so coded. A decoder decides that a 
bit or bits have been received if, in a majority of the frequency 
bands, the decoder detects a modulated index. 
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Initialize Input Buffer with zeros. Working buffer size is 
256 samples. Initialize Out buffer with zeros. Out buffer 
size is 128 samples. 
Sub-Block Counter = 0 
Long Block Counter = 0 



Shift data in second half of Input Buffer to fu^t half 
Copy data from second half of Temporary Buffer to first 
half of Out Buffer. 



Read 128 new samples into secondhalf of Input Buffer 



Multiply Input Buffer by Window Function and store in 
Temporary Buffer, 



Perform short block FFT on Temporary Buffer data and 
compute masking level and tonality ^ 



Determine frequencies for coding based on Long Block 
Counter. Synchronization corresponds to Long Block 
Counter = 0 



If tonality is acceptable and masking level is adequate 
compute code signals for all bands. 



Add code signal to Temporary Buffer 
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Add first half of Temporaiy Buffer to Output Buffer. . J^CODED AUDIO 
Send 128 samples of en coded data out. j,,^^^^^ —j . 



Sub-Block Counter +1 

If ( Sub-Block Counter - 64 ),Long Block Counter 1 
If Long Block Counter = 17, Long Block Counter = 0 
and New Message has to be coded. 



Figure 2 
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Initialize circular buffer with Os, initialize frequency bin arrays 
with Os 
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Read 256 samples into audio buffer 
Block Sample Counter = 0 
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Insert 2 new samples into circular buffer and push 2 of the oldest 
samples into the discarded array 
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Update frequency bin arrays by adding the effect of the 2 new 
samples and eliminating the effect of the 2 old samples m the 
discarded array 
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If Block Sample Counter is a multiple of 64, analyze the 
frequency bins and store result in an appropnate Status 
Infonnation Structure(SIS). 
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If SIS contains decoded message, send message out 



Increment Block Sample Counter by 2. 
Is Block Sample Counter = 256 ? 



55 



A 



56 



YES 


1 


r 



NO 



I 



Figures 



23 



^ ^ Atty. Docket No: 28049/36241 

DECLARATION FOR PATENT APPLICATION AND POWER OF ATTORNEY 

As a below named inventor, I hereby declare that my residence, post office address and citizenship are as stated below next to 
my name; I believe that I am the onginal, first and sole inventor (if only one name is hsted below) or an original, first and joint inventor 
(if plural names are listed below) of the subject matter which is claimed and for which a patent is sought on the invention entitled 
"MULTI-BAND SPECTRAL AUDIO ENCODING", the specification of which (check one): ^ is attached hereto; □ was filed 

on as Apphcation Serial No. and was amended on (if applicable); □ 

was filed as PCX International Application No. on and was amended under Article 19 on 

(if applicable). I hereby state that I have reviewed and understand the contents of the above-identified 

specification, including the claims, as amended by any amendment(s) referred to above. I acknowledge the duty to disclose to the Patent 
and Trademark Office all information known to me to be material to patentability as defined in 37 C.F.R. §1.56. 

I hereby claim foreign priority benefits under 35 U.S.C. §119 of any foreign application(s) for patent or inventor's certificate or 
of any POT international apphcation(s) designating at least one country other than the United States of America listed below and have 
also identified below any foreign application(s) for patent or inventor's certificate or any POT international apphcation(s) designating at 
test one country other than the United States of America filed by me on the same subj ect matter having a filing date before that of the 
a|phcation(s) of which priority is claimed: 



Priority Claimed 
□ □ 



^plication Serial Number) (Country) (Day/Month/Year Filed) Yes No 

□ □ 

^Application Serial Number) (Country) (Day/Month/Year Filed) Yes No 

:;i I hereby claim the benefit under 35 U.S.C, §1 19(e) of any United States provisional application(s) listed below: 
^^ipplication Serial Number) ~ (Day/Month/Year Filed) 

(Application Serial Number) ~ (Day/Month/Year Filed) 

I hereby claim the benefit under 35 U.S.C. §120 of any United States application(s) or PCT intemational application(s) 
designating the United States of America hsted below and, insofar as the subject matter of each of the claims of this application is not 
disclosed in the prior application(s) in the manner provided by the first paragraph of 35 U.S.C. § 1 12, 1 acknowledge the duty to disclose 
to the Office all information known to me to be material to patentability as defined in 37 C.F.R. §1.56 which occurred between the filing 
date of the prior apphcation(s) and the national or PCT intemational filing date of this application: 

(Application Serial Number) (Day/Month/Year Filed) (Status-Patented, Pending or Abandoned) 

(Application Serial Number) """^ (Day/Month/ Year Filed) ' (Status-Patented, Pending or Abandoned) 



1 



I hereby declare that all statements made herein of my own knowledge are true and that all statements made on information and 
^Belief are believed to be true; and further that these statements were made with the knowledge that willful false statements and the like 
so made are punishable by fine or imprisonment, or both, under 18 U.S.C. §1001 and that such willful false statements may jeopardize 
the validity of the application or any patent issued thereon. 

POWER OF ATTORNEY: I hereby appoint as my attorneys, with full powers of substitution and revocation, to prosecute this 
application and transact all business in the Patent and Trademark Office connected therewith: 



Alvin D. Shulman (19,412) 
Owen J. Murray (22,111) 
Allen H. Gerstein (22,218) 
Nate F. Scarpelli (22,320) 
Edward M. OToole (22,477) 
Michael F. Borun (25,447) 
Trevor B, Joike (25,542) 



Carl E. Moore, Jr. (26,487) 
Richard H. Anderson (26,526) 
Patrick D. Ertel (26,877) 
James P. Zeiler (28,491) 
William E. McCracken (30,195) 
Richard A. Schnurr (30,890) 
Anthony Nimmo (30,920) 



Christine A. Dudzik (3 1 ,245) 
Kevin D. Hogg (31,839) 
Jeffreys. Sharp (31,879) 
Martin J. Hirsch (32,237) 
James J. NapoH (32,361) 
Richard M. LaBarge (32,254) 
Li-Hsien Rin-Laures (33,547) 
Douglass C- Hochstetler (33,710) 



Douglass C. Hochstetler (33,710) 
Cynthia L. Schaller (34,245) 
Robert M. Gerstein (34,824) 
David W. Clough (36,107) 
Richard A. Brandon (37,051) 
James A. Flight (37,622) 
Roger A. Heppermann (37,641) 
David A. Gass (38,153) 
Gregory C. Mayer (38,238) 



Send correspondence to: Trevor B, Joike 



'flRM NAME 

Marshall, OToole, Gerstein, 
" ::p;Murray & Borun 



PHONE NO. 



312-474-6300 



STREET 

6300 Sears Tower 
233 South Wacker Drive 



CITY & STATE 



Chicago, Illinois 



ZIP CODE 



60606-6402 



IPull Name of First or Sole Inventor 


Citizenship 


■ Wenugopal Srinivasan 


India 


'■■ : rkesidence Address - Street 


Post Office Address - Street 


2845 Jarvis Circle 


2845 Jarvis Circle 


i:$ity(Zip) 


City (Zip) 


.!falm Harbor (34683) 


Palm Harbor (34683) 


: "-State or Country 


State or Country 


; Jlorida 


Florida 


::X>ate : 


Signature \ f C 



2 



APPLICABLE RULES AND STATUTES 

V 

37CFR 1.56. DUTY OF DISCLOSURE -INFORMATION MATERIAL TO PATENTABILITY (Applicable Portion) 

(a) A patent by its very nature is affected with a public interest. The pubhc interest is best served, and the most effective 
patent examination occurs when, at the time an appHcation is being examined, the Office is aware of and evaluates the teachings of all 
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(2) the closest information over which individuals associated with the filing or prosecution of a patent application 
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is disclosed to the Office. 
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1.56(a). 

35 US.C. 102. CONDITIONS FOR PATENTABILITY: NOVELTY AND LOSS OF RIGHT TO PATENT 
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a foreign country, before the invention thereof by the applicant for patent, or 

(b) the invention was patented or described in a printed pubhcation in this or a foreign country or in public use or on 
sale in this country, more than one year prior to the date of the application for patent in the United States, or 

(c) he has abandoned the invention, or 

7: (d) the invention was first patented or caused to be patented, or was the subject of an inventor's certificate, by the 

applicant or his legal representatives or assigns in a foreign country prior to the date of the application for patent in this country on an 
igplication for patent or inventor's certificate filed more than twelve months before the filing of the apphcation in the United States, or 
i, (e) the invention was described m a patent granted on an application for patent by another filed in the United States 

|)gfore the invention thereof by the applicant for patent, or on an international apphcation by another who has fulfilled the requirements 
ofparagraph (1), (2), and (4) of section 371(c) of this title before the invention thereof by the applicant for patent, or 
r;] (f) he did not hunself invent the subject matter sought to be patented, or 

rn (g) t>efore the applicant's invention thereof the invention was made in this country by another who had not abandoned, 

^^pressed, or concealed it. In determining priority of invention there shall be considered not only the respective dates of conception and 
l^uction to practice of the invention, but also the reasonable dihgence of one who was first to conceive and last to reduce to practice, 
i&om a time prior to conception by the other. 

35 US.C 103. CONDITIONS FOR PATENTABILITY; NON-OBVIOUS SUBJECT MATTER (Applicable Portion) 

A patent may not be obtained though the invention is not identically disclosed or described as set forth in section 1 02 
of this title, if the differences between the subject matter sought to be patented and the prior art are such that the subject matter as a whole 
would have been obvious at the time the invention was made to a person having ordinary skill in the art to which said subject matter 
pertains. Patentability shall not be negatived by the manner in which the invention was made. 

Subject matter developed by another person, which qualifies as prior art only under subsection (f) or (g) of section 1 02 
of this title, shall not preclude patentability under this section where the subject matter and the claimed invention were, at the time the 
invention was made, owned by the same person or subject to an obligation of assignment to the same person. 

35 US.C. 112. SPECIFICATION (Applicable Portion) 

The specification shaU contain a written description of the invention, and of the manner and process of makmg and using 
it, in such full, clear, concise, and exact terms as to enable any person skilled in the art to which it pertains, or with which it is most nearly 
connected, to make and use the same, and shall set forth the best mode contemplated by the inventor of carrying out his invention. 
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